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The following is a complete listing of all the claims in the application, with an 
indication of the status of each: 

Claims 1-22 (Canceled). 



1 23. (Currently Amended) A method of outlier detection comprising: 

2 generating a plurality of synthesized data, each representing a randomly 

3 generated state within a given vector space, said generating including a random 

4 number generation; 

5 receiving a plurality of real sample data, each representing a detected real 

6 event as represented in said given vector space; 

7 forming a candidate sample set comprising a union of at least a part of 

8 said plurality of synthesized data and said plurality of real sample data, said 

9 candidate sample set having a starting population, said candidate sample set 

10 being unsupervised as to which members will be classified by said method as 

1 1 being outliers; 

1 2 generating a set of classifiers, each member of said set being a procedure 

13 or a representation for a function classifying an operand data as an outlier or a 

14 non-outlier, said generating a set of classifiers including: 

1 5 initializing said set of classifiers to be an empty set, 

16 selectively sampling said candidate sample data to form a learning 

17 data set, said selectively sampling includinq XX:XX 

1 8 i) applying said set of classifiers to each of said candidate 

19 sample data and, if any classifiers are extant in said set, generating 

20 a corresponding set of classification results, 

21 ii) identifying a consistency, for each of said candidate 

22 sample data, among said data's corresponding set of classification 

23 results, 
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24 iii) calculating an uncertainty value for each of said c a ndidate 

25 sample data based on said identified consistency of said data's 

26 corresponding set of classification results, 

27 ]vl W> calculating a sampling probability value for each of said 

28 candidate sample data based, at least in part, on the corresponding 

29 uncertainty value based on said set of classification results, and 

30 y) tit) sampling from said candidate data to form said learning 

31 data set based, at least in part, on said sampling probability values, 

32 such that said learning data set has a population substantieity lower 

33 than the starting population, 

34 generating another classifier based on said learning data set, 

35 updating said set of classifiers to include said another classifier, 

36 and 

37 repeating said selectively sampling, said generating another 

38 classifier, and said updating until said set of classifiers includes at least f 

39 members; and 

40 generating an outlier detection algorithm based, at least in part, on' at least 

41 one of said another classifiers, for classifying a datum as being an outlier or a 

42 non-outlier. 

1 24. (Currently Amended) The method of claim 23, further comprising: 

2 receiving a given mis-classifying mis charact o r i z i ng cost data associated 

3 with at least one of said synthesized samples, representing a cost of said outlier 

4 detection algorithm mis-classifying mis charact e rizing said at least one 

5 synthesized sample as a non-outlier. 

1 25. (Currently Amended) The method of claim 24, wherein said constructing said 

2 learning data set from said candidate sample data is further based on said mis- 

3 classifying mis charact e r i z i ng cost 
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1 26. (Currently Amended) The method of claim 23, wherein said ca l culat i ng a 

2 samp li ng probab ili ty va l u e i nc l ud e s i d e ntify i ng a cons i st e ncy within sa i d s e t of 

3 c l ass i f i cation r e sults and calculating a probability of said identified consistency 

4 uses using a Binomial probability function. 

1 27. (Currently Amended) The method of claim 23, wherein said calcu l at i ng a 

2 sampling probab il ity valu e includ e s i d e nt i fying a cons i st e ncy with i n sa i d s o t of 

3 classificat i on re s u l ts and calculating a probability of said identified consistency 

4 uses us i ng a Gaussian probability function. 

1 28. (Currently Amended) The method of claim 23, said mach i n e r e adab l e cod e 

2 fof generating synthesized data generates said synthesized data in accordance 

3 with a given statistical likelihood of said generated data meeting an outlier 

4 criterion. 

1 29. (Previously Presented) The method of claim 23, wherein said generating an 

2 outlier detection algorithm generates the outlier detection algorithm such (hat 

3 said algorithm applies an aggregate of members of said set of classification 

4 algorithms, calculates a corresponding set of detection result data representing 

5 each of said aggregate's member's classification, and applies a voting scheme to 

6 said corresponding set of detection result data. 

1 30. (Currently Amended) The method of claim 23, wherein said ca l cu l ating of 

2 sa i d unc e rtainty va l u e i nc l ud e s i d e nt i fy i ng a cons is t e ncy among sa i d s e t of 

3 classificat i on r e su l ts, and calculating a probability of said consistency assumes T 

4 assum i ng each classification result within said set has a 50-50 probability of 

5 representing an operand as meeting an outlier sa i d alarm criterion, statistically 

6 independent of said operand and of all other classification results within said set. 



4 



Docket No. YOR920030521 US1 



Application Serial No.: 10/749,518 



1 31 . (Currently Amended) The method of claim 30, wherein said calcu l at i ng a 

2 samp li ng probabi l ity v al u e furth e r i nc l ud e s calculating a probability of said 

3 identified consistency uses using a Binomial probability function. 

1 32. (Currently Amended) The method of claim 30, wherein said calculat i ng a 

2 samp li ng probab i l i ty va l u e furth e r inc l udes calculating a probability of said 

3 identified consistency uses us i ng a Gaussian probability function. 

1 33. (Currently Amended) The method of claim 28, wherein said calculat i ng of 

2 sa i d unc e rta i nty valu e i nc l ud e s i d e nt i fying a cons i st e ncy a mong sa i d 

3 class i ficat i on r e sults, and calculating a probability of said consistency , assum i ng 

4 assumes each classification result within said set has a 50-50 probability of 

5 representing an operand as meeting said outlier a la rm criteria, statistically 

6 independent of said operand and of all other classification results within said set. 

1 34. (Currently Amended) The method of claim 33, wherein said calcu l at i ng a 

2 samp li ng p r o babili ty v alue fu rther includes calculating a probability of sak: 

3 identified consistency uses us i ng a Binomial probability function. 

1 35. (Currently Amended) The method of claim 33, wherein said ca l cu l ating a 

2 sampling probab ili ty va l u e furth e r includ e s calculating a probability of said 

3 identified consistency uses us i ng a Gaussian function. 

1 36. (Currently Amended) A system for classifying externally detected samples as 

2 one of at least normal and an outlier, comprising: 

3 a machine controller having a readable storage medium; 

4 a machine-readable program code, stored on the machine-readable 

5 storage medium, having instructions to: 

6 generate a plurality of synthesized data, each representing a randomly 

7 generated state within a given vector space, said generating including generating 

8 a random number g e n e rat i on ; 



5 



Docket No. YOR920030521 US1 



Application Serial No.: 10/749,518 



9 receive a plurality of real sample data, each representing an observed 

10 event as represented in said given vector space; 

1 1 form a candidate sample set comprising a union of at least a part of said 

12 plurality of synthesized data and said plurality of real sample data, said candidate 

1 3 sample set having a starting population; 

14 generate a set of classifiers, each member of said set being a procedure 

15 or a representation for a function classifying an operand data as a rare ev e nt or 

16 an outlier or a non-outlier , said generating a set of classifiers including: 

1 7 initializing said set of classifiers to be an empty set, 

18 selectively sampling said candidate sample data to form a learning 

19 data set, said selectively sampling includinq XX:XX 

20 i) applying said set of classifiers to each of said candidate 

21 sample data and, if any classifiers are extant in said set, generating 

22 a corresponding set of classification results, and 

23 ii) identifying a consistency, for each of said candidate 

24 sample data, among said data's corresponding set of classification 

25 results, 

26 iii) calculating an uncertainty value for each of said candidate 

27 sample data based on said identified consistency of said data's 

28 corresponding set of classification results, 

29 iy} ii) calculating a sampling probability value for each of said 

30 candidate sample data based, at least in part, on the corresponding 

31 uncertainty value based on said set of classification results, and 

32 y] Ui) sampling from said candidate data to form said learning 

33 data set based, at least in part, on said sampling probability values, 

34 such that said learning data set has a population substant i a ll y lower 

35 than the combined first and second population, 

36 generating another classifier based on said learning data set, 

37 updating said set of classifiers to include said another classifier, 

38 and 
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39 repeating said selectively sampling, said generating another 

40 classifier, and said updating until said set of classifiers includes at least t 

41 members; and 

42 to generate an outlier detection algorithm based, at least in part, on 

43 at least one of said another classifiers, for classifying a datum as being an 

44 outlier or a non-outlier. 

1 37. (Currently Amended) The method of claim 36, wherein said machine- 

2 readable program code further comprises instructions for: 

3 receiving a given mis-classifying m i s charact e rizing cost data associated 

4 with at least one of said example abnormal state data, said cost data 

5 representing a cost of said outlier detection algorithm mis-classifying rois- 

6 charact e r i z i ng a data equal to said abnormal state data as not meeting an outlier 

7 sa i d a l arm criterion. 

1 38. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions constructing said learning data set from said candidate sample 

3 data include instructions for constructing said learning data based, at least in 

4 part, on said mis-classifying m i s charact e r i zing cost. 

1 39. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for ca l cu l at i ng a sampling probab i l i ty va l u e i nc l ud e i nstruct i ons 

3 for i d e nt i fying a consist e ncy w i thin sa i d set of class i ficat i on r e su l ts and 

4 calculating a probability of said identified consistency include instructions for 

5 calculating said probability using a Binomial probability function. 

1 40. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for calcu l at i ng a samp l ing probab ili ty valu e i nc l ud e i nstructions 

3 for i d e ntifying a cons i stency w i thin sa i d s e t of c l ass i f i cat i on r e su l ts and 

4 calculating a probability of said identified consistency include instructions for 

5 calculating said probability using a Gaussian probability function. 
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1 41 . (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for generating synthesized data include instruction for 

3 generating said data in accordance with a given statistical likelihood meeting an 

4 outlier criterion. 

1 42. (Previously Presented) The system of claim 36, wherein said machine- 

2 readable code instructions for generating the outlier detection algorithm include 

3 instructions such that said algorithm applies an aggregate of members of said set 

4 of classification algorithms, calculates a corresponding set of detection result 

5 data representing each of said aggregate's member's classification, and applies 

6 a voting scheme to said corresponding set of detection result data. 

1 43. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for calculating said uncertainty value include instructions fef 

3 i d e nt i fy i ng a cons i s te ncy among sa i d set of c l ass i ficat i on r e su l ts, and calcu l at i ng 

4 a probabi li ty of sa i d cons i st e ncy, assum i ng that assume each classification result 

5 within said set has a 50-50 probability of representing an operand as meeting an 

6 outlier sa i d a l arm criterion, statistically independent of said operand and of all 

7 other classification results within said set. 

1 44. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for calculating a sampl i ng probab ili ty va l u e i nc l ud e i nstructions 

3 for calculat i ng a probability of said identified consistency include instructions for 

4 calculating said probability using a Binomial probability function. 

1 45. (Currently Amended) The system of claim 36, wherein said machine-readable 

2 code instructions for calculating a samp li ng probab ili ty va l u e i nc l ud e i nstructions 

3 for ca l cu l ating a probability of said identified consistency include instructions for 

4 calculating said probability using a Gaussian probability function. 



8 



Docket No. YOR920030521US1 



Application Serial No.: 10/749,518 



1 46. (Currently Amended) The system of claim 41 , wherein said machine-readable 

2 code instructions for calculating of said uncertainty value include instructions fer- 

3 i d e nt i fying a consist e ncy among sa i d set of c l assification r e sults, and ca l cu l at i ng 

4 a probab i lity of sa i d consist e ncy, assuming that assume each classification result 

5 within said set has a 50-50 probability of representing an operand as meeting 

6 said outlier criterion alarm cr i t e r i a , statistically independent of said operand and 

7 of all other classification results within said set. 

1 47. (Currently Amended) The system of claim 45, wherein said machine-readable 

2 code instructions for calculating a samp li ng probab ili ty va l u e i nc l ud e i nstruct i ons 

3 for c al cu l at i ng a probability of said identified consistency include instructions for 

4 calculating said probability using a Binomial probability function. 

1 48. (Currently Amended) The system of claim 45, wherein said machine-readable 

2 code instructions for calculating a samp li ng probabi li ty va l u e inc l ud e i nstructions 

3 for c al cu la ting a probability of said identified consistency include instructions for 

4 calculating said probability using a Gaussian probability function. 
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